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Abstract. We settle the question of tight thresholds for offline cuckoo hashing. The problem can be 
stated as follows; we have n keys to be hashed into m buckets each capable of holding a single key. 
Each key has fc > 3 (distinct) associated buckets chosen uniformly at random and independently of the 
choices of other keys. A hash table can be constructed successfully if each key can be placed into one 
of its buckets. We seek thresholds Ck such that, as n goes to infinity, if n/m < c for some c < Ck then 
a hash table can be constructed successfully with high probability, and if n/m > c for some c > a 
hash table cannot be constructed successfully with high probability. Here we are considering the offline 
version of the problem, where all keys and hash values are given, so the problem is equivalent to previous 
models of multiple-choice hashing. We find the thresholds for all values of A; > 2 by showing that they 
are in fact the same as the previously known thresholds for the random fc-XORSAT problem. We then 
extend these results to the setting where keys can have differing number of choices, and provide evidence 
in the form of an algorithm for a conjecture extending this result to cuckoo hash tables that store multiple 
keys in a bucket. 

1 Introduction 

Consider a hashing scheme with n keys to be hashed into m buckets each capable of holding a single key. 
Each key has fc > 3 (distinct) associated buckets chosen uniformly at random and independently of the 
choices of other keys. A hash table can be constructed successfully if each key can be placed into one 
of its buckets. This setting describes the offline load balancing problem corresponding to multiple choice 
hashing [1] and cuckoo hashing [14,26] with fc > 3 choices. An open question in the literature (see, for 
example, the discussion in [24]) is to determine a tight threshold such that if n/m < c for some c < Ck 
then a hash table can be constructed successfully with high probability, and if n/m > c for some c > 
Ck a hash table cannot be constructed successfully with high probability. In this paper, we provide these 
thresholds. 

We note that, in parallel with this work, two other papers have similarly provided means for determining 
the thresholds [13,15]. Our work differs from these works in substantial ways. Perhaps the most substantial 
is our argument that, somewhat surprisingly, the thresholds we seek were actually essentially already known. 
We show that tight thresholds follow from known results in the literature, and in fact correspond exactly to 
the known thresholds for the random fc-XORSAT problem. We describe the fc-XORSAT problem and the 
means for computing its thresholds in more detail in the following sections. Our argument is somewhat 
indirect, although all of the arguments appear to rely intrinsically on the analysis of corresponding random 
hypergraphs, and hence the alternative arguments of [13,15] provide additional insight that may prove useful 
in further explorations. 
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With this starting point, we extend our study of the cuckoo hashing problem in two ways. First, we con- 
sider irregular cuckoo hashing, where the number of choices corresponding to a key is not a fixed constant 
k but itself a random variable depending on the key. Our motivations for studying this variant include past 
work on irregular low-density parity-check codes [20] and recent work on alternative hashing schemes that 
have been said to behave like cuckoo hashing with "3.5 choices" [18]. Beyond finding thresholds, we show 
how to optimize irregular cuckoo hashing schemes with a specified average number of choices per key; for 
example, with an average of 3.5 choices per key, the optimal scheme is the natural one where half of the keys 
obtain 3 choices, and the other half obtain 4. Second, we consider the generalization to the setting where a 
bucket can hold more than one key. We provide a conjecture regarding the appropriate threshold behavior 
for this setting, and provide a simple algorithm that, experimentally, appears to perform remarkably close to 
the thresholds predicted by our conjecture. 

Section 2 presents an exposition of known results on cores of random hypergraphs. Readers familiar 
with this material may want to skip directly to Section 3, which provides our proof that the thresholds for 
/j-XORSAT and k-ary cuckoo hashing are identical. In Section 4 we extend the discussion of thresholds 
to the case where k is any real number greater than 2. Finally, Section 5 presents our simple algorithm to 
construct hash tables and presents experimental evidence that it is able to achieve load factors close to the 
thresholds. Further details appear in the appendices. 

2 Technical background on cores 

The key to our analysis will be the behavior of cores in random hypergraphs. We therefore begin by providing 
a review of this subject. To be clear, the results of this section are not new; the reader is encouraged to see [23, 
Ch. 18], as well as references [7,10,25] for more background. 

We consider the set of all /c -uniform hypergraphs with m nodes and n hyperedges Q^^^ „. More precisely, 
each hypergraph G from Q^^ „ consists of n (labeled) hyperedges of a fixed size fc > 2, chosen independently 
at random, with repetition, from the (™) subsets of {1, ... , m} of size k. This model will be regarded as a 
probability space. We always assume k is fixed, m is sufficiently large, and n = cm for a constant c. 

For £ > 2, the £-core of a hypergraph G is defined as the largest induced sub-hypergraph that has mini- 
mum degree £ or larger. It is well known that the ^-core can be obtained by the following iterative "peeling 
process": While there are nodes with degree smaller than £, delete them and their incident hyperedges. By 
pursuing this process backwards one sees that the ^-core, conditioned on the number of nodes and hyper- 
edges it contains, is a uniform random hypergraph that satisfies the degree constraint. 

The fate of a fixed node a after a fixed number of h iterations of the peeling procedure is determined 
by the /i-neighborhood of a, where the /i-neighborhood of a is the sub-hypergraph induced on the nodes at 
distance at most h from a. For example, the 1 -neighborhood contains all hyperedges containing a. In our 
setting where n is linear in m the ^.-neighborhood of node a is a hypertree of low degree (at most log log m) 
with high probability. We assume this in the discussion to come. 

We can see whether a node a is removed from the hypergraph in the course of h iterations of the peeling 
process in the following way. Consider the hypertree rooted from a (so the children are nodes that share a 
hyperedge with a, and similarly the children of a node share a hyperedge with that node down the tree). First, 
consider the nodes at distance h—1 from a and delete them if they have at most £—2 child hyperedges; that 
is, their degree is at most £ — 1. Second, treat the nodes at distance /i — 2 in the same way, and so on, down 
to distance 1, the children of a. Finally, a is deleted if its degree is at most £ — 1. 

The analysis of such random processes on trees has been well-studied in the literature. (See, for example, 
[4,19] for similar analyses.) We wish to determine the probability that node a is deleted after h rounds of 
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the peeling process. For j < h let pj be the probability that a node at distance h — j from a is deleted after j 
rounds of the peeling process. The discussion becomes easier for the binomial random hypergraph with an 
expected number of cm hyperedges: Each hyperedge is present with probability fc! • c/m''^^ independently. 
It is well known that Q^^ „ and the binomial hypergraph are equivalent as far as asymptotic behavior of cores 
are concerned when c is a constant. 

Let Bin(iV, p) denote a random variable with a binomial distribution, and Po(/3) a random variable with 
a Poisson distribution. Below we make use of the Poisson approximation of the binomial distribution and 
the fact that the number of child hyperedges of a node in the hypertree asymptotically follows the binomial 
distribution. This results in additive terms that tend to zero as m goes to infinity. We have po — 0, 
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Pr[Po(fcc) < ^ - 2] ±o(l), 



Pj+i = Pr 



Bin 



TO — 1 
fc- 1 



fc! • 



c 



'^"i) < ^ - 2] ±o(l), for j = 1 



<e-2 



= Pr[Po(fcc(l -pj < ^ - 2] ±o(l), for j = 1,... 2. 
The probability qh that a itself is deleted is given by the following different formula: 

qh - Pr[Po(fcc(l - Ph-l)''-^) < ^ - 1] ± o(l). 



(1) 



The Pj are monotonically increasing and < Pj < l,so p — limp^ is well-defined. The probability that 
a is deleted approaches p from below as h grows. Continuity of the functions involved implies that p is the 
smallest non-negative solution of 

p = Pr[Po(fcc(l-p)'=-^) <£-2]. 

Observe that 1 is always a solution. Equivalently, applying the monotone function 1 1— > fcc(l — t)'^"^ to both 
sides of the equation, p is the smallest solution of 



fcc(l - p)"-^ = fee (1 - Pr[Po(fcc(l - pf-^) <£-2 



. fc-i 



(2) 



Let /3 = fcc(l — p)'^^^. It is helpful to think of /3 with the following interpretation: 

Given a node in the hypertree, the number of child hyperedges (before deletion) follows the distribution 

Po(fcc). Asymptotically, a given child hyperedge is not deleted with probability (1— p)*^^^, independently for 

all children. Hence the number of child hyperedges after deletion follows the distribution Po(fcc(l —p)''^^). 

And /3 is the key parameter for the node giving the expected number of hyperedges containing it that could 

contribute to keeping it in the core. 
Note that (2) is equivalent to 
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This motivates considering the function 



(Pr[Po(/?) >£-!]) 
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fc ' (Pr[Po(^) > ^ - 
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which has the following properties in the range (0, oo): It tends to infinity for /3 — > 0, as well as for /? — > oo. 
Since it is convex there is exactly one global minimum. Let PI ^ = arg min^ gk,i{P) and ^ = min gk^iiP). 
For f3 > (51 ^ the function gk,i is monotonically increasing. For each c > ^ let /3(c) = $k.e{c) denote the 
unique f3 > f3'^ ^ such that gk,i{P) — c. 

Coming back to the fate of a under the peeling process. Equation (1) shows that a is deleted with prob- 
ability approaching Pr[Po(/3(c)) <£—!]. This probability is smaller than 1 if and only if c > which 
implies that the expected number of nodes that are not deleted is linear in n. As the /i-neighborhoods of 
two nodes a and h are disjoint with high probability, by making use of the second moment we can show 
that in this case a linear number of nodes survive with high probability. (The sophisticated reader would use 
Azuma's inequality to obtain concentration bounds.) 

Following this line of reasoning, we obtain the following results, the full proof of which is in [25]. (See 
also the related argument of [23, Ch. 18].) Note the restriction to the case k + £ > 4, which means that the 
result does not apply to 2-cores in standard graphs; since the analysis of standard cuckoo hashing is simple, 
using direct arguments, this case is ignored in the analysis henceforth. 

Proposition 1. Let k + i > 4 and G be a random hype rgraph from „. Then ^ is the threshold for the 
appearance of an £-core in G. That is, for constant c and m — oo, 

(a) if n/m — c < then G has an empty £-core with probability 1 — o(l). 

(b) if n/m — c > c^. ^, then G has an £-core of linear size with probability 1 — o(l). 

In the following we assume c > ^. Therefore /3(c) > i exists. Let to be the number of nodes in the 
£-core and n be the number of hyperedges in the ^-core. We will find it useful in what follows to consider the 
edge density of the £-core, which is simply the ratio of the number of hyperedges to the number of nodes. 

Proposition 2. Let c > c*^ ^ and n/m — c{l zt o(l)). Then with high probability in Q^^ „ 

TO = Pr[Po(/3(c)) > £] • TO ± o(to) and n = (Pr[Po(^(c)) >£- 1])'= • n ± o(to). 

The bound for m follows from the concentration of the expected number of nodes surviving when we 
plug in the limit p for ph in equation (1). The result for h follows similar lines: Consider a fixed hyperedge 
e that we assume is present in the random hypergraph. For each node of this hyperedge we consider its 
/i-neighborhood modified in that e itself does not belong to this /i-neighborhood. We have k disjoint trees 
with high probability. Therefore each of the k nodes of e survives h iterations of the peeling procedure 
independently with probability Pr[Po(/3(c)) >£—!]■ Note that we use £ — 1 here (instead of i) because 
the nodes belong to e. Then e itself survives with (Pr[Po(/3(c)) > £~ 1])*^. Concentration of the number of 
surviving hyperedges again follows from second moment calculations or Azuma's inequality. 

With this we have the information needed regarding the edge density of the ^-core. 

Proposition 3. If c > c^. ^ and n/m — c{l± o(l)) then with high probability the edge density of the £-core 
of a random hypergraph from „ is 

/3(c)-Pr[Po(/3(c))>^-l] 

fc • Pr[Po(/3(c)) > ^] ^ 

This follows directly from Proposition 2, where we have also used equation (3) to simplify the expression 
for n. 

We define Ck,e as the unique c that satisfies 

/3(c).Pr[Po(/3(c))>^-l] ^ 

k ■ Pr[Po(^(c)) >£] ' ^ ' 
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The values Ck,i will prove important in the work to come; in particular, we next show that c/c 2 is the threshold 
for fc-ary cuckoo hashing for k > 2. We also conjecture that c^^^+i is the threshold for fc-ary cuckoo hashing 
when a bucket can hold £ keys instead of a single key. 

The following table contains numerical values of Cfc ^ for £ = 2, ... ,7 and k — 2, ... ,7 (rounded to 10 
decimal places). Some of these numbers are found or referred to in other works, such as [7, Sect. 5], [22, 
Sect. 4.4], [23, p. 423], [12], and [5]. 



1.7940237365 
2.8774628058 
3.9214790971 
4.9477568093 
5.9644362395 



0.9179352767 
1.9764028279 
2.9918572178 
3.9970126256 
4.9988732941 
5.9995688805 



0.9767701649 
1.9964829679 
2.9993854302 
3.9998882644 
4.9999793407 
5.9999961417 



0.9924383913 
1.9994487201 
2.9999554360 
3.9999962949 
4.9999996871 
5.9999999733 



0.9973795528 
1.9999137473 
2.9999969384 
3.9999998884 
4.9999999959 
5.9999999998 



0.9990637588 
1.9999866878 
2.9999997987 
3.9999999969 
5.0000000000 
6.0000000000 



3 Equality of thresholds for random fc-XORSAT and fc-ary cuckoo hashing 

We now recall the random fc-XORSAT problem and describe its relationship to cores of random hypergraphs 
and cuckoo hashing. The fc-XORSAT problem is a variant of the satisfiability problem in which every clause 
has fc literals and the clause is satisfied if the XOR of values of the literals is 1 . Equivalently, since XORs 
correspond to addition modulo 2, and the negation of Xi is just 1 XOR Xi, an instance of the fc-XORSAT 
problem corresponds to a system of linear equations modulo 2, with each equation having fc variables (none 
of which is negated), and randomly chosen right hand sides. (In what follows we simply use the addition 
operator where it is understood we are working modulo 2 from context.) 

For a random fc-XORSAT problem, let <?j!^ „ be the set of all sequences of n linear equations over m 
variables xi, . . . , x,n, where an equation is 

^ji + ■ ■ ■ + Xj^ — bj , 

where bj G {0, 1} and {ji, . . . ,jk} is a subset of {1, . . . ,m} with fc elements. We consider „ as a 
probability space with the uniform distribution. 

Given a fc-XORSAT formula F, it is clear that F is satisfiable if and only if the formula obtained from 
F by repeatedly deleting variables that occur only once (and equations containing them) is satisfiable. Now 
consider the fc-XORSAT formula as a hypergraph, with nodes representing variables and hyperedges repre- 
senting equations. (The values bj of the equations are not represented.) The process of repeatedly deleting all 
variables that occur only once, and the corresponding equations, is exactly equivalent to the peeling process 
on the hypergraph. Hence, after the peeling process, we obtain the 2-core of the hypergraph. 

This motivates the following definition. Let i^'j^j „ be the set of all sequences of n equations such that 
each variable appears at least twice. We consider If",^ „ as a probability space with the uniform distribution. 

Recall that if we start with a uniformly chosen random fc-XORSAT formula, and perform the peeling 
process, then conditioned on the remaining number of equations and variables (n and m), we are in fact left 
with a uniform random formula from tf/^ -j. Hence, the imperative question is when a random formula from 
'^A n ^ill satisfiable. In [10], it was shown that this depends entirely on the edge density of the corre- 
sponding hypergraph. If the edge density is smaller than 1, so that there are more variables than equations, 
the formula is likely to be satisfiable, and naturally, if there are more equations than variables, the formula 
is likely to be unsatisfiable. Specifically, we have the following theorem from [10]. 
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Theorem 1. Let k > 2 be fixed. For n/m = 7 and m 00, 

(a) if-f > 1 then a random formula from „ is unsatisfiable with high probability. 

(b) if^ < 1 then a random formula from If"^ „ is satisfiable with high probability. 

The proof of Theorem 1 in Section 3 of [10] uses a first moment method argument for the simple direction 
(part (a)). Part (b) is significantly more complicated, and is based on the second moment method. Essentially 
the same problem has also arisen in coding theoretic settings; analysis and techniques can be found in for 
example [21]. It has been suggested by various readers of earlier drafts of this paper that previous proofs of 
Theorem 1 have been insufficiently complete, particularly for fc > 3. We therefore provide a detailed proof 
in Appendix C for completeness. 

We have shown that the edge density is concentrated around a specific value depending on the initial 
ratio c of hyperedges (equations) to nodes (variables). Let Ck,2 be the value of c such that the resulting 
edge density is concentrated around 1. Then Proposition 3 and Theorem 1 together with the preceding 
consideration implies: 

Corollary 1. Let k > 2 and consider 'P'^^ „ . The satisfiability threshold with respect to the edge density 
c — n/m is Ck,2- 

Again, up to this point, everything we have stated was known from previous work. We now provide the 
connection to cuckoo hashing, to show that we obtain the same threshold values for the success of cuckoo 
hashing. That is, we argue the following: 

Theorem 2. For k > 2, Ck^2 is the threshold for k-ary cuckoo hashing to work. That is, and with n keys to 
be stored and m buckets, with c = n/m fixed and m — 00, 

(a) if c > Cfc_2. then k-ary cuckoo hashing does not work with high probability. 

(b) if c < Cfc.2, then k-ary cuckoo hashing works with high probability. 

Proof : Assume a set of n keys S is given, and for each x E S a random set C {1, . . . , m} of size k of 
possible buckets is chosen. 

To prove part (a), note that the sets for x E S can be represented by a random hypergraph from „. 
If n/m = c > cj, 2 and m — !• 00, then with high probability the edge density in the 2-core is greater than 1. 
The hyperedges in the 2-core correspond to a set of keys, and the nodes in the 2-core to the buckets available 
for these keys. Obviously, then, cuckoo hashing does not work. 

To prove part (b), consider the case where n/m — c < 2 and m — > 00. Picking for each x a 
random £ {0, 1}, the sets A^, x £ S, induce a random system of equations from „. Specifically, 
Ax = {ji, ■ ■ ■ ,jk} induces the equation Xj-^ + ■ ■ ■ + Xj^ — h^. 

By Corollary 1 a random system of equations from (^J;;^ „ is satisfiable with high probability. This implies 
that the the matrix M made up from the left-hand sides of these equations consists of linearly independent 
rows with high probability. This is because a given set of left-hand sides with dependent rows is only satis- 
fiable with probability at most 1 /2 when we pick the hx at random. 

Therefore we have an n x n-submatrix in M with a nonzero determinant. The expansion of the de- 
terminant of this submatrix as a sum of products by the Leibniz formula must contain a product with all 
factors being variables Xi - (as opposed to 0). This product term corresponds to a permutation mapping keys 
to buckets, showing that cuckoo hashing is indeed possible. □ 

We make some additional remarks. We note that the idea of using the rank of the key-bucket matrix to 
obtain lower bounds on the cuckoo hashing threshold is not new either; it appears in [11]. There the authors 
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use a result bounding the rank by Calkin [6] to obtain a lower bound on the threshold, but this bound is not 
tight in this context. More details can be found by reviewing [6, Theorem 1.2] and [23, Exercise 18.6]. Also, 
Batu et al. [3] note that 2-core thresholds provide an upper bound on the threshold for cuckoo hashing, but 
fail to note the connection to work on the fc-XORSAT problems. 

4 Non-integer choices 

The analysis of fc-cores in Section 3 and the correspondence to fc-XORSAT problems extends nicely to the 
setting where the number of choices for a key is not necessarily a fixed number fc. This can be naturally 
accomplished in the following way: when a key x is to be inserted in the cuckoo hash table, the number of 
choices of location for the key is itself determined by some hash function; then the appropriate number of 
choices for each key x can also be found when performing a lookup. Hence, it is possible to ask about for 
example cuckoo hashing with 3.5 choices, by which we would mean an average of 3.5 choices. Similarly, 
even if we decide to have an average of fc choices per key, for an integer fc, it is not immediately obvious 
whether the success probability in fc-ary cuckoo hashing could be improved if we do not fix the number of 
possible positions for a key but rather choose it at random from a cleverly selected distribution. 

Let us consider a more general setting where for each x E U the set is chosen uniformly at random 
from the set of all fc^. -element subsets of [m], where follows some probability mass function on 
{2, . . . , m}.' Let = E{kx) and i<* — Tlxes ^'^^^ '^hat n* is the average (over all x E S) worst 
case lookup time for successful searches. We keep n* fixed and study which sequence {px)xes maximizes 
the probability that cuckoo hashing is successful. 

We fix the sequence of the expected number of choices per key {nx)xes and therefore k*. Furthermore 
we assume < n — 2, for all x E S\ obviously this does not exclude interesting cases. For compactness 
reasons, there is a system of probability mass functions that maximizes the success probability. We will 
show the following: 

Proposition 4. Let {px)xe s '^^ optimal sequence. Then we have, for all x e S*.- 

Px{Vi^x\) = 1- {kx - Vhx\), and Px{Ikx\ + 1) = Kx - [kx\. 

That is, the success probability is maximized if for each x E S the number of choices kx is concentrated 
on [kx\ and Ikx\ + 1 (when the number of choices is non-integral). Further, in the natural case where all 
keys X have the same expected number k* of choices, the optimal assignment is concentrated on [k* \ and 
[k* \ + 1. Also, if is an integer, then a fixed deg optimal. This is very different from other 

similar scenarios, such as erasure- and error-correcting codes, where irregular distributions have proven 
beneficial [20]. 

The proof is given in Appendix A. 

4.1 Thresholds for non-integral degree distributions 

We now describe how to extend our previous analysis to derive thresholds for the case of a non-integral 
number of choices per key; equivalently, we are making use of thresholds for XORSAT problems with an 
irregular number of literals per clause. 

' We could in principle also consider the possibility of keys having only a single choice. However, this is generally not 
very interesting since even a small number of keys with a single choice would make an assignment impossible whp., 
by the birthday paradox. Hence, we restrict our attention to at least two choices. 
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Following notation that is frequently used in the coding literature, we let Ak be the probabiUty that a key 
obtains k choices, and define A{x) — J^k ^kx''. Clearly, then, A'{x) = J^k ^kkx'^~^, and ^'(1) = k*. 
(We assume henceforth that Ao ~ Ai = and yl^ = for all k sufficiently large for technical convenience.) 

We now follow our previous analysis from Section 2; to see if a node a is deleted after h rounds of the 
peeling process, we let pj be the probability that a node at distance h — j from a is deleted after j rounds. 
We must now account for the differing degrees of hyperedges. Here, the appropriate asymptotics is given by 
a mixture of binomial hypergraphs, with each hyperedge of degree k present with probability fc! • cAk/m''~^ 
independently. 

The corresponding equations are then given by po ~ 0, 



Pi = Pr 



Pr 



TO — 1 
fc- 1 



fc 

Po(fccyifc) <e~2 



k\-^\<e~2 

TO*^ i 



±o(l), 



Pr[Po(cyl'(l)) < ^ - 2] ±o(l), 



Pj+i = Pr 



Pr 



^Po(fccylfe(l -pjO'^^i) <£-2 



±o(l), for j = l,...,/i-2, 



= Pr[Po(cyl'(l -pj)) <£-2]± o(l), for j = 1, . . . , /i - 2. 

Note that we have used the standard fact that the sum of Poisson random variables is itself Poisson, which 
allows us to conveniently express everything in terms of the generating function A{x) and its derivative. As 
before we find p = limpj, which is now given by the smallest non-negative solution of 

p = Pr[Po(cyl'(l -p)) <e-2]. 

WTien given a degree distribution {Ak)k, we can proceed as before to find the threshold load that allows 
that the edge density of the 2-core remains greater than 1 ; using the approach of Appendix C, this can again 
be shown to be the required property for the corresponding XORSAT problem to have a solution, and hence 
for there to be a permutation successfully mapping keys to buckets. Notice that this argument works for all 
degree distributions (subject to the restrictions given above), but in particular we have already shown that the 
optimal thresholds are to be found by the simple degree distributions that have all weight on two values, [k* J 
and [k* \ + 1. Abusing notation slightly, let Ck. .2 be the unique c such that the edge density of the 2-core of 
the corresponding mixture is equal to 1, following the same form as in Proposition 3 and equation (4). The 
corresponding extension to Theorem 2 is the following: 

Theorem 3. For k* > 2, c^,' ,2 is the threshold for cuckoo hashing with an average of k* choices per key to 
work. That is, with n keys to be stored and m buckets, with c = n/m fixed and m — > 00, 

(a) if c > Ck* .2, for any distribution on the number of choices per key with mean n* , cuckoo hashing does 
not work with high probability. 

(b) if c < Ck*,2. then cuckoo hashing works with high probability when the distribution on the number of 
choices per key is given by px ( Ynx\ ) = 1 — {>^x — L'^^ J ) <^^d px ( [kxJ + 1) = — [_'^x\ , for all x G S. 
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Fig. 1. Thresholds for non-integral K*-ary cuckoo hashing, with optimal degree distribution. The values in 
the tables are rounded to the nearest multiple of 10^^''. 

We have determined the thresholds numerically for a range of values of k* . The results are shown in 
Figure 1. One somewhat surprising finding is that the threshold for n* < 2.25 appears to simply be given 
by c = 0.5/(3 — K*). Consequently, in place of using 2 hash functions per key, simply by using a mix of 2 
or 3 hash functions for a key, we can increase the space utilization by adding 33% more keys with the same 
(asymptotic) amount of memory. 

5 Algorithm for computing a placement 

In this section, we describe an algorithm for finding a placement for the keys using /c-ary cuckoo hashing 
when the set S of keys is given an advance. The algorithm is an adaptation of the "selfless algorithm" 
proposed by Sanders [27], for the case k = 2, and analyzed in [5], for orienting standard undirected random 
graphs so that all edges are directed and the maximum indegree of all nodes is at most £, for some fixed 
£ > 2. We generalize this algorithm to hypergraphs, including hypergraphs where hyperedges can have 
varying degrees. 

Of course, maximum matching algorithms can solve this problem perfectly. However, there are multiple 
motivations for considering our algorithms. First, it seems in preliminary experiments that the running times 
of standard matching algorithms like the Hopcroft-Karp algorithm [17] will tend to increase significantly 
as the edge density approaches the threshold (the details of this effect are not yet understood), while our 
algorithm has linear running time which does not change in the neighborhood of the threshold. This proves 
useful in our experimental evaluation of thresholds. Second, we believe that algorithms of this form may 
prove easier to analyze for some variations of the problem. 

We first describe the generalized selfless algorithm for bucket size £ = 1. A description in pseudocode 
is given as Algorithm 1. The algorithm can deal with arbitrary hypergraphs, uniform or not. The aim is 
to "orient" the hyperedges of the hypergraph G, i. e., associate a node v £ e to each hyperedge e so that 
at most one hyperedge is directed towards any one node v. Initially, all hyperedges are unoriented. Nodes 
that have an hyperedge directed towards them are saturated and are not considered further, and similarly 
hyperedges once oriented are fixed. At each step, if there is a node v that is incident to only one undirected 
hyperedge e, we direct v to e, breaking ties arbitrarily. (In the pseudocode, this is realized by giving such 
nodes the highest priority, which is 0. Note that this rule entails that the algorithm starts by carrying out the 
peeling process for the 2-core. But the rule is also applied when hyperedges from the 2-core have akeady 
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been treated.) If there are no such nodes, every unoriented hyperedge is assigned as its weight the number 
of unsaturated nodes it contains. (Intuitively, a smaller weight means a higher need to direct the hyperedge.) 
The priority of a node v then is the sum of the inverses of the weights of the hyperedges that contain v. 
This corresponds to the expected number of hyperedges v would have directed toward it if all its unoriented 
hyperedges were directed to one of their nodes at random. Now a vertex v of smallest (highest) priority is 
chosen, again breaking ties at random. If this priority is larger than 1, then the algorithm stops and reports 
"failure". This is because the sum of all priorities is the number of undirected hyperedges, so if the smallest 
priority is bigger than 1, the number of undirected hyperedges is larger than the number of unsaturated nodes, 
and it is impossible to complete the process of directing the hyperedges. Otherwise the algorithm directs the 
minimum weight incident hyperedge of v toward v, breaking ties randomly. (Intuitively, this means that the 
algorithm tries to continue the peeling process "on average".) This step is repeated until all hyperedges have 
been oriented or failure occurs. 



Algorithm 1: {k, l)-GeneraHzed Selfless 

Input: Hypergraph G = {V, E) with m nodes and n hyperedges. 

Purpose: Direct all hyperedges such that the maximum indegree is at most 1. 

// lA{v) set of undirected hyperedges incident to node v 
for t 1 to n do 

Vq-^ {v £V : \U{v)\ > 0}; Eq -i^ {e € E : e undirected}; 

forall the e G £0 do / / calculate edge weight aj(e) 
1^ u}{e) |{u G e : no hyperedge is directed towards v}\; 

forall the u G Vo do / / calculate priority ■k{v) 

it a hyperedge is directed towards v then 7r(ii) <— 2 / / saturated 

else 

it\U{v)\ = lthen7r(t;) ^ 0else7r(i;) ^ Eeeww 

find V £ Vo with smallest priority (break ties by randomization); 
if 7r(u) > 1 then return failure; 

choose minimum weight hyperedge e £U{v) (break ties by randomization); 
_ direct e towards v 



We ran the generalized selfless algorithm for hypergraphs with 10^ and 10^ nodes and tabulated the fail- 
ure rate around the theoretical threshold values Ck.2 for k = 3,4, 5. Results demonstrate that the generalized 
selfless algorithm achieves results quite near the threshold; more details and figures are given in Appendix B. 

5.1 A conjecture, with evidence from a generalized selfless algorithm 

Now consider a situation in which buckets have a capacity of ^ > 1 keys. There is as yet no rigorous analysis 
of the appropriate thresholds for cuckoo hashing for the cases k > 2 and £ > 1. However, our results of 
Section 2 suggest a natural conjecture: 

Conjecture 1. For fc-ary cuckoo hashing with bucket size l, it is conjectured that cuckoo hashing works with 
high probability if n/m = c > Ck/+i, and does not work if n/m = c < Ck,i+i, i. e., that the threshold is at 
the point where the {£ + l)-core of the cuckoo hypergraph starts having edge density larger than (!.. 
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In order to provide evidence for this conjecture, we generalize our algorithm further so that it can deal 
with bucket size £ > 1. The pseudocode is given as Algorithm 2. In hypergraph language, we are now looking 
for an orientation of the hyperedges of G so that every node has at most £ hyperedges directed toward it. 
Now a node is saturated if it has £ edges pointing to it. As long as there are nodes v such that the number of 
hyperedges directed toward v and the number of undirected hyperedges containing v taken together does not 
exceed £, one such node is chosen and its undirected edges are directed toward it. Again, the effect of this 
rule is that the algorithm starts by carrying out the peeling process that finds the {£ + l)-core. Otherwise, 
the algorithm assigned weights and priorities as before, and if all priorities exceed £, the algorithm stops and 
reports failure. If the smallest (highest) priority is at most £, a vertex of smallest priority is chosen and one 
of the incident undirected hyperedges of minimum weight is directed toward it. The process is carried out 
until all hyperedges have been directed or failure occurs. 



Algorithm 2: {k, ^) -Generalized Selfless 

Input: Hypergraph G = {V, E) with m nodes and n hyperedges. 

Purpose: Direct all hyperedges such that the maximum indegree is at most I. 

// set of hyperedges directed towards node v 

11 U{v) set of undirected hyperedges incident to node v 

for t 1 to n do 

Vo^ {v eV : \U(v)\ > 0}; Eo ^ {e e E : e undirected}; 

forall the e G £() do / / calculate edge weight oj{e) 
[_ uj{e) ^ \{v e e : \V{v)\ < £}\; 

forall the D G Vo do / / calculate priority 7r(w) 
if \U{v)\ + \V{v)\ < £ then n{v) ^ 0; 
[_ else^W^Eee^^MZJR + l^'WI; 
find V £ Vo with smallest priority (break ties by randomization); 
if 7r(i;) > £ then return failure; 

choose minimum weight hyperedge e £U{v) (break ties by randomization); 
direct e towards v 



Experiments with Algorithm 2 corroborate Conjecture 1, in that they show that the failure rate of the 
algorithm changes from to 1 very close to the possible threshold values suggested in Conjecture 1. Again, 
numerical results are given in Appendix B. 

6 Conclusion 

We have found tight thresholds for cuckoo hashing with 1 key per bucket, by showing that the thresholds are 
in fact the same for the previous studied fc-XORSAT problem. We have generalized the result to irregular 
cuckoo hashing where keys may have differing numbers of choices, and have conjectured thresholds for the 
case where buckets have size larger than 1 based on an extrapolation of our results. 

References 

1. Y. Azar, A. Broder, A. Karlin, and E. Upfal. Balanced allocations. SIAM Journal on Computing, 29(1): 180-200, 
1999. 



11 



2. R.N. Bhattacharya and R. Rao, Normal approximation and asymptotic expansions, Wiley, New York, (1976). 

3. T. Batu, R Berenbrink, and C. Cooper. Balanced allocations: Balls-into-bins revisited and chains-into-bins. CDAM 
Research Report LSE-CDAM-2007-34. 

4. A. Broder, A. Frieze, and E. Upfal. On the satisfiability and maximum satisfiability of random 3-CNF formulas. In 
Proceedings of the Fourth Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 322-330, 1993. 

5. J. A. Cain, R Sanders, and N. Wormald. The random graph threshold for fc-orientiability and a fast algorithm for 
optimal multiple-choice allocation. In Proceedings of the 18th Annual ACM-SIAM Symposium on Discrete Algror- 
ithms, pp. 469-476, 2007. 

6. N. Calkin. Dependent Sets of Constant Weight Binary Vectors. Combinatorics, Probability, and Computing, 
6(3):263-271, 1997. 

7. C. Cooper. The size of the cores of a random graph with a given degree sequence. Random Structures and Algo- 
rithms, 25(4):353-375, 2004. 

8. N. Creignou and H. Daude. Smooth and sharp thresholds for random /c-XOR-CNF satisfiability. Theoretical Infor- 
matics and Applications, 37(2): 127-147, 2003. 

9. N. Creignou and H. Daude. The SAT-UNSAT transition for random constraint satisfaction problems. Discrete Math- 
ematics, 309, No 8 (2009), 2085-2099. 

10. O. Dubois and J. Handler. The 3-XORSAT threshold. In Proceedings of the 43rd Symposium on Foundations of 
Computer Science, pp. 769-778, 2002. 

11. M. Dietzfelbinger and R. Pagh. Succinct data structures for retrieval and approximate membership. In Proceedings 
of the 35th ICALP, pp. 385-396, 2008. 

12. D. Fernholz and V. Ramachandran. The fc-orientability thresholds for Gn,p. In Proceedings of the 18th Annual 
ACM-SIAM Symposium on Discrete Algrorithms, pp. 459-468, 2007. 

13. N. Fountoulakis and K. Panagiotou. Sharp load thresholds for cuckoo hashing. CoRR, abs/0910.5147, 2009. 
Submitted on 27 Oct 2009. 

14. D. Fotakis, R. Pagh, P. Sanders, and P. Spirakis. Space efficient hash tables with worst case constant access time. 
Theory of Computing Systems, 38(2):229-248, 2005. 

15. A. M. Frieze and P. Melsted. Maximum matchings in random bipartite graphs and the space utilization of cuckoo 
hashtables. CoRR, abs/0910.5535, 2009. Submitted on 29 Oct 2009 (vl), revised II Nov 2009 (v2). 

16. Brian Gough. GNU Scientific Library Reference Manual - Third Edition. Network Theory Ltd., 2009. online: 
http : //www . gnu .org/ software/xGSl /manual/. 

17. J. E. Hopcroft and R. M. Karp. An n^^^ algorithm for maximum matchings in bipartite graphs. SIAM Journal on 
Computing, 2(4):225-231, 1973. 

18. E. Lehman and R. Panigrahy. 3.5-Way cuckoo hashing for the price of 2-and-a-bit. In Proceedings of the 17th 
Annual European Symposium on Algorithms, pp. 671-681, 2009. 

19. M. Luby, M. Mitzenmacher, and M.A. Shokrollahi. Analysis of random processes via and-or tree evaluation. In 
Proceedings of the Ninth Annual ACM-SIAM Symposium on Discrete Algorithms, pp. 364-373, 1998. 

20. M. Luby, M. Mitzenmacher, M.A. Shokrollahi, and D. Spielman. Efficient erasure correcting codes. IEEE Transac- 
tions on Information Theory, 47(2):569-584, 2001. 

21. C. Measson, A. Montanari, and R. Urbanke. Maxwell construction: the hidden bridge between iterative and maxi- 
mum a posteriori decoding. IEEE Transactions on Information Theory, 54(I2):5277-5307, 2008. 

22. M. Mezard, F. Ricci-Tersenghi, and R. Zecchina. Two solutions to diluted p-spin models and XORSAT problems. 
J. Statist. Phys. 111(3/4): 505-533, 2003. 

23. M. Mezard and A. Montanari. Information, Physics, and Computation. Oxford University Press, 2009. 

24. M. Mitzenmacher. Some open questions related to cuckoo hashing. In Proceedings of the 17th Annual European 
Symposium on Algorithms, pp. 1-10, 2009. 

25. M. Molloy. Cores in random hypergraphs and Boolean formulas. Random Structures and Algorithms, 27(1):124- 
135, 2005. 

26. A. Pagh and F. Rodler. Cuckoo hashing. Journal of Algorithms, 51(2):122-144, 2004. 

27. P Sanders. Algorithms for Scalable Storage Servers. In Proceedings ofSOFSEM 2004, pp. 82-101, 2004. 



12 



A Optimality of degree distribution 



We present here the proof of Proposition 4. Specifically, we show that if {px)x(^s is an optimal sequence, 
then for all x G 5: 

Px{[nx\) = 1- {kx~ [hx\), and Px{[_K.x\ +1) ^ Kx~ [kx\. 

Proof : We consider a random bipartite graph Gs with left node set S, right node set [m] and an edge 
between two nodes x ^ S and a e [m] if and only if a G Ax- Let the sequence {K,x)xes be fixed. For each 
a; € we want to obtain a distribution px for the degree kx (or, equivalently, the cardinality of Ax), such 
that we have E{kx) = Hx and the following quantity is maximized: 

Pr("success") :— VT{{Ax)xes admits a left-perfect matching^ in Gs) ■ (5) 

We study the sequence {px)xe s that realizes the maximum. Let z be an arbitrary but fixed element of S with 
probability mass function p^. To prove Proposition 4 it is sufficient to show that if there exist two numbers I 
and k with I < < k and fc — Z > 2 as well as Pz{l) > and Pz{k) > then (5) cannot be maximal. 

We start by fixing kx and Ax for each x E S — {z} and consider the corresponding bipartite graph 
Gs-{z}- Let B C [m] be the set of right nodes in Gs^{z} that are matched in every matching. Then there is 
a matching for the whole key set S in Gs if and only if Az % B. Note that < \B\ < m, i- e., there must be 
at least one right node that is not matched. Let p — mm{pz{l), Pz{k)} > and \B\ = b. We will show that 
changing pz to 

P'zil) Pz{l) ~ P P'zik) Pz{k) - p 

p'z{l + l) ■.^Pzil + l)+P P'z{k~l) -.^ Pz{k ^ 1) + p, 

with p'zU) = Pz{j) for j ^ {/, k}, increases (5), while leaving Kz unchanged. This is the case if and only if 

, (fc) ^ ii + i) , (fe-i) 

P- 7^ +P- -Pm^ >P- ji^ +P- Ti^rx' (6) 

\l) \k) \l + l) \k-l) 

and the strict inequality holds for at least one value b that occurs with positive probability. The left sum of 
(6) is the 2 • p fraction of the failure probability (by Pz{l) and Pz{k)) before the change of pz under the 
condition that B has cardinality b; the right sum is the corresponding fraction of the failure probability after 
the change. Depending on b we have to distinguish several cases. 

Case 1: 6 = m — 1. In this case both sides of (6) are equal, i. e., the modification we do to pz will not change 
the success probability. 

Case 2: k < b < m—1. Canceling p and subtracting (^) / (™) and (i^i) / from both sides of (6) shows 
that the strict inequality holds if and only if 

b---{b-l + l) b---{b-l) ^ b---{b-k + 2) b---{b-k + l) 



TO • • • (to — / + 1) TO • • • (m — I) m • • • (m — fc + 2) to • • • (m — fc + 1) 
Factoring out ^tpT\ on the left side and ^•••(''~*'+^) qj^ j-jjg right side gives 

'm--- (rn— i + l) m- ■ ■ (m— fc+2} to 

b---{b-l + l) ^ b---{b-k + 2) ^m---{m-k + l) ^b---{b-k + 2) 
TO • • • (to — I) TO • • • (m — fc + 1) TO • • • (to — I) b ■ ■ ■ {b — I + 1) 

^ In the following "matching" and "left-perfect matching" are used synonymously. 
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Since Z < fc — 2, this is equivalent to 



{m^l + l)-{m-l)---{m^k + l)> {b~l)-{b-l-l)---{b-k + 2) , 



(9) 



which is true for to — 1 > 6. 
Case 3: I < b < k. Calculations along the lines of case 2 show that the strict inequality of (6) also holds in 



Case 4: < 6 < /. In this case both sides of (6) are zero, i. e., the modifications we do to will not change 
the success probability. 

Since in cases 1 and 4 above there was no change in the success probability, to show that (5) cannot be 
maximal when k — I > 2 as we are considering, it remains to show that at least one of the Cases 2 and 
3 occurs with positive probability. We construct a situation in which one of these cases applies, and which 
occurs with positive probability. 

Choose degrees k^ for all elements x G S — {z} such that < and Px{kx) > 0. Consider a 
permutation of the elements x e S* — {z} such that these degrees are ordered, i.e., k^^ < kx2 < • ■ ■ < 
kx„_i - Choose the first element Xi with i > I and k^- < i- Such an element must exist, since we assume 
kx < < n — 2, in particular we have / < ?i — 2. Arrange that Ax^ C [i], 1 < j < i, such that there 
is a matching in G^xi,....xi}- This implies b > I. Then arrange that \Axj — Ui<j'<j ^a:^/ 1 = 1' for 
i < j < n — 2, as well as |^2;„_i ^ Ui<j'<ji-i ^x-, \ = 2, which implies 6 < to — 1. This finishes the proof 
of Proposition 4. □ 

B Performance results for the generalized selfless algorithm 

We present some performance results for the generalized selfless algorithm We ran the generalized selfless 
algorithm for hypergraphs with lO'^ and 10^ nodes and tabulated the failure rate around the theoretical 
threshold values Ck,2 for k = 3,4, 5. For each pair (to, k) we considered 81 edge densities c = — , spaced 
apart by 0.0001, thus covering an interval of length 0.008, which encloses the theoretical threshold value for 
the particular parameter pair (m, k). The hyperedges of the hypergraphs were randomly chosen via pseudo 
random number generator MT19937 "Mersenne Twister" of the GNU Scientific Library [16]. We measured 
the average failure rate of the algorithm over 100 random hypergraphs for each combination (to, n, k) within 
the parameter space. To get an estimation of the threshold, i. e., the rate c where the algorithm switches from 
success to failure, we fit the sigmoid function 



to the measured failure rate (via gnuplot ), using the method of least squares. We determined the parameters 
a, b that lead to a (local) minimum of the sum of squares of the 81 residuals, denoted by J^res- The parameter 
a is the inflection point of (10) and therefore the approximation of the threshold of the generalized selfless 
algorithm. Figures 2, 3 and 4 show the results of the experiments. 

One observes that this simple algorithm is able to construct the placements for edge densities quite close 
to the calculated thresholds Ck.2- The slope of the sigmoid curve increases and J^res decreases with growing 
TO and k, leading to a sharp transition from total success to total failure. Clearly the algorithm can fail 
on hypergraphs that admit a matching. Experimental comparisons with a perfect matching algorithm [17] 
showed that this is very unlikely for random hypergraphs. An example is given in Figure 5, which shows 

' gnuplot, an interactive plotting program, version 4.2, http : / /www . gnuplot . info 
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Fig. 3. edge size = 4; theoretical threshold Cfe,2 ~ 0.97677 



0.9 

0.8 - 



measured data 

(l + e-C-*'/")-' 




09 - 
0,8 - 
0.7 - 
0.6 - 
05 - 
0.4 - 
03 - 
0.2 - 



measured data 

(l + e-C-"")- 



i:„, = 0.0019705 
o = 0.992441 



0.989 099 0991 0992 0.993 0.994 0995 0996 



0989 0.99 0.991 0.992 0.993 0994 0995 0.996 



(a) m = 10^ (b) m = 10® 

Fig. 4. edge size = 5; theoretical threshold Ck,2 ~ 0.99244 
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the failure rate of perfect matching in comparison to the generalized selfless algorithm. Note that the plot 
shows an interval of size 0.004, i. e., 41 data points instead of 81. The differences in the failure rates of the 
algorithms become very small as m grows. 

Similarly, we find our generalized algorithm for the case where the bucket size £ is greater than 1 has 
similar behavior. For an example see Figure 6. 
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C Proof of the threshold for fc-XORSAT 



In this section we give a full proof of the threshold for fc-XORSAT (Corollary 1). The proof employs the 
notation and facts developed in Sections 2 and 3, especially Propositions 2 and 3, and the following fact 
(known as "Friedgut's Theorem" for fc-XORSAT [8,9]). 

Fact 1. For every fc > 3 there exists a function Cfe(m) < 1 such that, for every e > and a random formula 
F (system Ax = h of equations) from „ we have the following: 



lim Pr[F is satisfiable] 



1, ifn = ck{m){l-e)m 
0, ifn — Cfc(r7i)(l + e)m 



Recall from Section 3 that <?f„ „ can be regarded as a probability space whose elements are pairs [A, b) where 
A is an n X TO matrix with entries in {0, 1}, each row containing fc I's, and h G {0, 1}". Alternatively, A 
can be regarded as a node-edge incidence matrix Aq of a fc-uniform hypergraph G € Gm n- the obvious 
correspondence we identify „ with the set of bipartite graphs G with n left nodes ("check nodes") and m 
right nodes ("variable nodes") and degree fc at each left node. Similarly, if'^'^ ^ is the probability space whose 

elements are pairs {A, b), b S {0, 1}", where A is either the incidence matrix of a fc-uniform hypergraph H 
with TO nodes, fi edges, and minimum degree 2 or the adjacency matrix Ah of a bipartite graph H with h 
left nodes and to right nodes, with degree fc at each left node and minimum degree 2 at each right node. We 
use the same notation for both and let V.'^ be the set of all these graphs."^ The following lemma is central. 

Lemma 1. For any 6 > there exists e — e{6) > such that the following happens. Let H e T-L^ ^ 
be uniformly random with n < to(1 — S) and denote by the number of solutions of the linear system 
Ahx^O (over GF[2]). Then 

Vx[Zh = 2'"""] > e{5) > . (11) 

We note that a full proof of this lemma for the special case fc = 3, with Vy[Zh = 2™^"] = 1 — o(l), was 
given in [10]. 

Proof of Corollary 1 (assuming Lemma 1): Consider the following two cases. 

(i) 2 < c < Ck.2- Let a system Aqx — bht chosen at random from „. Reducing G to its 2-core H 
leads to a system A^x = b with to variables, n equations, and rank(A/f ) = rank(A(3) — (to — to). 
The graph H is random in "H^ ^. By Propositions 2 and 3, with high probability fi < (1 — 5) to for some 
6 — S{c), and m = 0{m). By Lemma 1, for to large enough, we get Pr[Z^f = 2™^"] > e{6) > 0. 
This implies Pr[AGX = bis satisfiable] > Pr[AG has full row rank] — Pi:[Zg — 2™^"] > e{6). 

(ii) c > Ck.2- Let Aqx ~ b and its reduced version Ahx ~ be as in (i). By Propositions 2 and 3, with 
high probability ri > (1 + S)m for some 6 — S{c), and to = 0{m). We have rank(A) < to, and by the 
randomness of b we have Pi[Ahx — bis satisfiable] < 2™"" < 2"'^™. 

Combining parts (i) and (ii) with Friedgut's Theorem (Fact 1) shows that lim,„_j.oo Ck{m) — Cfe,2, which 
implies Corollary 1. □ 
We now move to the proof of Lemma 1, which focuses on the 2-core H of the graph G, and we condition 
on its number of nodes. With a slight abuse of notation we will drop the "hat" from our notations. In other 
words, we now let iJ be a uniformly random graph from "H^ „ and let -f — n/m (see Theorem 1). 



For simplicity we assume that for each left node a sequence of k right nodes is chosen at random, allowing and 
ignoring repetitions. The difference from fc-uniform hypergraphs is negligible. 
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It is convenient to introduce some additional notation. Given a formal series p(z), coeff [p(z), z"^] denotes 
the coefficient of z"^ in p{z). We further introduce the notations 

q{z) = (e^ - 1 - z) , 

It is easy to see that z Q{z) is a strictly increasing function with lim^^o Q{z) — 2, and limz_j.oo Q{z) = 
oo. Further z Pk{z) is strictly increasing with Yaa^^Q Pk{z) — 0, and lim^-^oo Pk{z) = k for k even, 
and k — 1 otherwise. 

Further we define the domain sets 



Qiz) 



zq [z) 
q{z) ' 

Pk{z) = 



zPkjz) 
Pkiz) 



(12) 
(13) 



2^m,n = {{w,l) (^I? : < w < m,2w < I < kn- 2(m -w),l even } , (14) 

V^{e)^{{u,X)eR^ : e < uj < 1 - e , + e < X < I ~ ~ e} , (15) 

V„,,n{£)^{{w,l)eV,n^^ : (^,^)el?^(e)}, (16) 

{£) ■ (17) 

The assertion of Lemma 1 now follows from the following sequence of lemmas, to be proven in the 
subsections below. 

Lemma 2. Let Zh be the number of solutions of the linear system Ahx — 0. Then 

nzH] = ^ E ^(^'0, (18) 

iVf) — 

where we define 

No = {kny. coeff[(e" - 1 - z)™, z'="] , (19) 

N{w, I) = (^2^ n{kn ~ ly. coeff[(e" - 1 - z)™, z'] coeff [(e^ - 1 - z)"'""', z*^""'] coeff[pfe(z)", z'] . 

(20) 

Lemma 3. For any S > there exists e > such that, ifn< m{l — S), then 

N{w,l)<2"^'. (21) 

(w,l)f^V^,^(e) 

Lemma 4. For any (5 > 0,e > there exists C — C{5,e) such that, if mS < n < to(1 — 6) and 
(w, I) e I?m,„(e), then 



NjwJ) C 

— < — exp 

No m 



(-'/'(^,^)), (22) 
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where, letting h{z) = — zlogz — (1 — z) log(l — z),^ we define 



ipiui, A) = h{uj) — kj h(X) — log q{s) + kj log s (23) 
+ a;logq(a) — fc7Aloga + (1 — w) logq{b) — ^7(1 — A) log 6 
+ 7 log pk (c) - fc7A log c . 

Finally, a = a{uj. A), h — b{uj, A), c — c(a;, A), and s are the unique non-negative solutions of 

Q{s)^k^, g(a) = ^, Q{h)=^-^j^^, (24) 

Pk{c)^k\. (25) 

Lemma 5. For any 7 < 1, the function ip : 'D^{0) M achieves its unique global maximum at {u, A) = 
(1/2, 1/2), with V(l/2, 1/2) = (1 - 7) log 2. 

Further, there exists ^ > such that — HesSi/,(l/2, 1/2) >z £,l2-^ 

Finally, let us recall a well known fact about lattice sums (see for instance [2]). 

Lemma 6. Let D be an open domain in W\ and F : D ^ R be continuously differentiable, achieving its 
unique maximum in z^, G D, with HessF(-z*) ^ £.Idfor some ^ > 0. Then there exists C > such that, for 
any S > 

^ exp {^F{6x)) < ^ exp (iF(z,)) . (26) 

a;eZ<* : xSeD 

Proof of Lemma 1: The proof is simply obtained by putting together Lemmas 2, 3, 4, 5 and using Lemma 
6 (with F{xi, X2) — ipixi, 2x2/(^7)), d = 2 and S = l/n) to bound the sum. □ 



C. 1 Proof of Lemma 2 

Clearly Nq is the number of graphs in „. Indeed it is the number of way of putting nk distinct balls in 
TO bins in such a way that each bin contains at least 2 balls. 

The claim follows by proving that, for each {w, I) E 2?m,ri, N{w, I) is the number of couples {H, x) 
where H S "Hj^i n ^i^^ ^ ^ {0: 1}'" with Ahx = mod 2, such that x has w ones and H has I edges 
incident on variable (right) nodes i such that Xi = 1. Indeed, (^) gives the number of ways of choosing the 
ones. Paint by red the I edges incident on these nodes, and by blue the other {kn — I) edges. The coefficient 
factors give the number of ways of attributing red/blue edges to nodes on the two sides. The factorials give 
the number of ways of matching edges of the same color on the two sides. □ 



C.2 Proof of Lemma 4 

Let us start by proving a lower bound on Nq. For any s > 0, we have 



No^ikny.^^ Pr 



m 



Xj — kn 



(27) 



log means logarithm to the base e 

Hess^ denotes the Hessian matrix of ip and I2 the 2x2 unit matrix 
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where Xi , . . . , Xm are i.i.d. Poisson random variable (with parameter s) conditioned to Xi > 2, i.e., for any 

g > 2, 

1 s« 

e'* — 1 — s ql 



Ft[X, 



(28) 



By assumption s is chosen such that Es[Xi] — Q{s) — kj ^ {kd,k{l — 6)). By the local central limit 
theorem for lattice random variables of [2, Corollary 22.3], we have Pis [X^J^i = kn\ > C j yjm for 
sone constant C"(^), whence, using Stirling's formula 



e J 



kn 



(29) 



Consider now N{w, I). By the central limit theorem for the sum of Bernoulli random variables, for any 

mS < w < m{l — S), we have 



m\ ^ C'2{6) ^„ih{w/m) 

w J ~ Jra 



Treating the coefficient terms as above, and using Stirling's formula for /, (kn — I) = 0{m), we get 



N(w,l) < ^e™'^('"/") (- 
m V e 



knl\ 
e / 



(kn-l) 



kn—l 



The claim is proved by taking the ratio of the bounds (31) and (29). 



(30) 

(31) 

□ 



C.3 Proof of Lemma 5, outline 

We now present an outline of the proof of Lemma 5. Appendix D contains the additional details for a 
complete proof. 

For (w, A) = (1/2, 1/2), Eqs. (24), (25) admit the unique solution a = h = s and c = 1. A straightfor- 
ward calculation yields -0(1/2, 1/2) = (1 — 7) log 2. 

Call X;a,b, c) the right hand side of Eq. (23). Notice that the derivatives of tf' with respect to a, b, c 
vanish by Eqs. (24), (25). Therefore it is easy to compute the partial derivatives 



dip , 1 — cj 
— = log 



log 



g(a) 



^ = -fc7log^'— - fc7log " - kjlogc. 
a\ A 



(32) 
(33) 



Using the fact that a = b = s and c = 1 at {oj, A) = (1/2, 1/2), we get that the gradient of ip vanishes at 
(1/2, 1/2), and again, 0(1/2, 1/2) = (1 - 7) log 2. 

By a somewhat longer calculation, we obtain the following second derivatives 



-4 1 



1/2,1/2 



dXduj 
duP- 



1/2,1/2 



= -4 



s2C ' 
{kl? 



1/2,1/2 



s^C 



(34) 
(35) 
(36) 
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with 

c.^^-22i£,^j>„. ,3,) 

q(s) <l(s) s 

It is easy to deduce that — Hcss^, (1/2, 1/2) is positive definite. 

The function ip : T>^{0) — > M is continuous in 1)^(0) and differentiable in its interior. Further, we have 
the following asymptotic behaviors (first two at fixed A, second two at fixed lu): 

r , r '^^ ^^c^ 

lim — — = +00 , lim — — = —oo , (3o) 

lim — = +CX1 , hm — = -oo . (39) 

A-s.2w/(fc7) aA A^l-2(1-Lj)/(A;7) aA 

Therefore any global maximum of ip must be a stationary point in the interior of ^'-^(O). We next will prove 
that (1/2, 1/2) is the only such point. 

Notice that ^{co, A; a, b, c) is convex with respect to a, b, c. As a consequence 

'ip{u!, A) = min W{uj, X;a,b,c) . (40) 
We will construct an upper bound on ^p by choosing a, b, c appropriately. The first remark is that 

^{1 -Lu,l-X;b, a, 1/c) = ^{u, A; a, b,c)-j log , , . (41) 

Since, for c e [0, 1] (which is guaranteed by Eq. (25) for A G [0, 1/2]) we have Pk{c) > c''pk{l/c), we can 
restrict without loss of generality to A < 1/2 (whence c e [0, 1]). 

Next notice that, maximizing 'F over w, we get 'P{uj, A; a, b, c) < (A; a, 6, c), where 

!Z'i(A;a,6,c) = log (g(a) + g(6)) - ^7 /i(A) - log g(s) + fc7 log s (42) 
— fc7A log a — fc7(l — A) log b + 7logpfc(c) — fc7A log c . 

Next fix c = c(A) = bX/{a — aX). Since this transformation is invertible, we can as well keep c as a free 
parameter, and let A = ac/{ac + b). If we let ^'2(0, 6, c) = !f'i(ac/ [ac + 5); a, 6, c), we get 

•Z'2(a,&,c) =log(g(a)+g(6)) -logg(s)+7logpfc(c) (43) 
— fc7 log (ac + 6) + A;7A log s . 

Also, without loss of generaUty, we can rescale a by a factor s, and set b = s, therefore defining ^'3(0, c) = 
'p2{sa, s, c). If we introduce the notation 

q(s) e** — 1 — ,s 

we get the expression 

Ma,c) = -fc7log(l + ac) + log (1 + /l,(a)) +7logpfe(c). (45) 
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By the above derivation we have the following relation with i/)(a;, A): 

V^(W,A) < 1'i{c)l^x/a,ic){l-X) - (46) 

^a{c) = •Z'3(a*(c),c) , a*(c) = argmin!f3(a, c) . (47) 

a>0 

A direct calculation shows that a*(l) = 1 and •^'3(1, 1) = (1 — 7) log 2. This point corresponds to (w, A) = 
(1/2, 1/2) through the above derivation. We will show that c = 1 is indeed the global maximum of ^'4(0) 
for c e [0, 1], which implies the assertion. 

Maximizing ^j,{a, c) with respect to c implies a*(c) to be the unique non-negative solution of the sta- 
tionarity condition 

(48) 

{I + cf-^ + {1 - c)^-^ 
On the other hand, the stationarity condition with respect to a yields 

As (a) 



l + yl,.(a)-aA,(a) 



(49) 



where we used the fact that yls(l) = and defined \s{x) = A'g{x)/ A'^{1). 

Equations (48) and (49) admit the solutions a = c = and a = c = 1, and is easy to check that these 
are both local maxima of 1^4. We will show that they admit only one more solution with c e (0,1), that 
necessarily is a local minimum of 'F4. Indeed, if we let a = tanhx, c — tanh y, Eq. (48) becomes 

x={k~l)y. (50) 

Our claim is therefore implied by Lemma 7 below. □ 



Lemma 7. For s > 0, let 



Define F, : M ^ M Z^y 



Mt) = ^-, , \s{t) = ^—. (51) 

e'* — 1 — s — 1 



F.(.) = atanh (/(tanh.)) , m ^ , ^ ^^^fi ■ (52) 

Then Fg is convex on [0, cx)). 

Proof : This can be seen simply by graphing Fg [x), or by some calculus which we omit. □ 



C.4 Proof of Lemma 3 

The proof is analogous to the one of Lemma 4. We have just to be careful to the values of w, I near the 
boundary of the domain !?„, .„. Luckily we only need a loose upper bound. Equation (29) remains true 
in the present case (as it only hinges on n — 0{m)). On the other hand using (™) < exp{mh{w/in)), 
coeS[f{x)'',x^] < f{af/a^ and m! < ^/2^ (m/e)"+i/2, we get 

N{w,l) < 27re-'="-ie""(«'/™) l'+'^'{kn - Z)('=«-'+i/2) ^ . (53) 

a 0*^" ' c 
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for any a,b,c> 0. Taking the ratio, and bounding polynomial factors ^yi{kn — I) < Cm we get 

— ^ ' ^ < C m exp (rnip(w/m, X/kn)^ , (54) 

whence 

V N{w,l) <Cm^ exp{msup{il;{uj,X) : {uj,X) £V^{0)\V^{e)}) (55) 

Nq ^ 

with ip{uj, X) defined as in Eq. (23). Notice that ip : T>^{S) — > M is a continuous function. It is therefore 
sufficient to show that it is strictly smaller than (1 — 7) log 2 on the boundaries of its domain. This indeed 
follows from Lemma 5. □ 



D Appendix D: Proof of Lemma 5 

This appendix contains the full proof of Lemma 5 of Appendix C. 

D.l Relation to Appendix C 

We use the following notation: 



Q{x) 



q{x) := exp(a;) — x — 1 
X ■ q'{x) a;(exp(a;) — 1) 



q{x) exp(a;) — a; — 1 

k 



p(x) = ^•PPLUS(.T,fc)= 



j even 



• x\ 



p{x) 

PPLUS(a;, y) = (1 + x)'" +{l-x)y,x arbiti-ary if y > integer ,l>a;>Oify>0 
PMINUS(a;, y) = (1 + x)'" - (1 - x)^, x arbiti-ary if y > integer, l>a;>Oify>0. 

Observe that Q{x) is the expectation of the integer random variable with probability of i > 2 being = 
x^ /{ilq{x)). Similarly for P{x). 
We have 

Q{x) is strictly monotonously increasing for x > 0. 

lim Q{x) — X — and Q{x) > x (56) 

a;— f 00 

lim Q{x) = 2 

P{x) is strictly monotonously increasing from to k if fc even, fc — 1 if A; odd . (57) 
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Proof that Q{x) is increasing. Simple calculus: 



1 



Q{x) X exp(x) — 1 

1 1 exT>{x) o /si 

— ^ + 7 ( \_U2 < a;2 + 2 < cxp(a;) + ^^^^^^ (58) 



dx Q{x) x"^ (exp(a;) — 1)-^ exp(a;) 

d I 1 
With several differentiations, using — — — — — 

dx exp(a;) exp(a;) 

the right-hand- side inequality of (58) follows from 2 < exp(a;) H — — 

exp(a;) 

which follows from (Gxp(a;) - if > 0. 



n = jjvariables , m = U equations, 7 := m/n, 7 < 1 , fc = jj variables per equation . 
Note: We have exchanged the meaning of n and m when compared to Appendix C. 

fc7 = Q(s(fc,7)) defines s{k,-/) > 2 (cf. (56.)) (59) 

As (3(2.0) = 2.911 . . . (and (5(2.2) 3.03 ... ) and fc > 3 the assumption s(fc,7) > 2 can be made 
without loss of generality. 



For u!,\ we assume throughout, that there exists an e > such that 

2cj , 2(1-0;) 
e<uj<l-e and — +e<A< -— ^ - e. (60) 

Condition (60) ensures that we stay away from the boundary of the domain allowed for w, A (cf. (15) of 
Appendix C.) 

Definition 1. For a,b,c,s>0 

9(0) f q{b) 



\Pi{u},a,b,s) := 



q{s)u}J \{l-u})-q{s) 



tf'(w, A, a,5, c, s) := tf'i(a;, a, fe, s) • (!f'2(A, a, 6, c, s))'*^. 

Definition 2. 

ip{Ld, X) -.^ 'F(Ld, X,a,b,c, s)) , with s = s{k,j) 

and a, b, c given by Q{a) = —, Q{b) = ~ X)^l ^ p(^^^ = X-k (Recall (56), (57) ) (61) 

m 1 — UJ 

Oberserve that (60) ensures that a, 6, c > in Definition 2. 
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Lemma 8 (Formula (40) from Appendix C). 



niin if'fa;, A, a, 6, c, s 



The point left unfinished in Appendix C is a full proof of the following theorem. 

Theorem 4 (Lemma 5 in Appendix C). For 7 < 1 '>jj{uj, X) achieves its unique global maximum over cj, A 
saitsfying (60) for (uj,\) = [1/2,1/2). And ^{1/2,1/1) = 2^'^. 

Wehave V(l/2, 1/2) = !f (1/2, 1/2, s, s, 1, s) = 2i-T. 

As aheady observed in Appendix C we can restrict attention to A < 1/2. 

Lemma 9. For A > 1/2 we have A) < — w, 1 — A). 

Proof : 

Let A = l — A<l/2 and uj = 1 — uj. Let a, 6, c be such that ^/'(a). A) = ^{oj., A, a, 6, c, s) 



Then c < 1 as P(l) = -fc (cf. (61)) 
tf'i(a;, 6, a, s) = i^'i(a), a, 6, s). 




fc 



Now, Lemma 8 implies the claim. 



□ 



Definition 3. For a > 0, s > and < c < 1 we define 



ri(a,s) = 1 + 




PPLUS (c, Q(s)) 



c, s) = A(a, s) • r2(a, c, s 



Next the key proposition. 



Proposition 5. 



Let s>2. For Q < P < 1 there exist < a, c < 1 with P = ac, 
such that r(a, c, s) < 2, equality only for P — or P — 1. 
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Proof of Theorem 4 from Proposition 5. 



With s = s{k, 7) for b and as with a > for a in Definition 1 
we have tp{Ld, A) < '1^{uj, X, as, s, c, s) ( Lemma 8 .) 

Mu;,as,s,s) = { ( . X % 1 + ^ = r,{a,s) (AGM inequality ) 

\q{s)ojJ \{1 - uj) ■ q{s) J q{s) 

API 
P := -. Then A = , 1 - A = . Let ac = P. 

1-A 1 + P ' 1 + P 



1 + ac/ Vl + ac/ y Vl + ac' 

The definition of tf' requires if'2 therefore: p{c) = 

= \ {{l + cf^'-^ + (1 - cf'''^) < I ((1 + c)^-^ + (1 - cpf' = \ (PPLUS(c,fc7))^/^ (63) 

Note x\+ x\ < {xi + x^Y for 2/ > 1, x\, X2 > 0, equahty only for y — lox one of the Xi = 0. 

By Q{s) = k'y (62) and (63) implies 

/ -, \ Qis) 

i'F.iX, as, s,c,s)r < T— (p(c)r < 
\l + acj 



For 0<A<l/2we have < P < 1 and Proposition 5 applies. With a, c which satisfy Proposition 5 we 
have Theorem 4 (using Lemma 9.) Note that the bound 2^^'' is only reached for a; = 1/2 and P = 1 that is 
A = 1/2. (The case P ~ need not be considered for Theorem 4.) □. 



D.2 Proof of Proposition 5 

Remark 1. We prove Proposition 5 only for s > 6. We have (5(6) = 6.09 . . . and the proof covers all 
> 6.09 .... Reading the details it should be clear that we can also find a proof for 6 > s > 2. Some 
additional, purely technical effort seems unavoidable for this. 

We usually write Q instead of Q{s) (cf. (59.)) 

We need to consider P(a, c, s) for < a, c < 1. It has the following properties. 

Lemma 10. (a) P(0, 0, s) = 2, P(0, 1, s) = 2'3(''), P(l, 0, s) = 2 • 2 = 4, P(l, 1, s) = 2 

(b) For each < c < 1 P(a, c) has only one extremum in a. It is a minimum and < a^/^ < 1. If c = 
then ayyiiy^ ~Oifc~l then a^^^j^ ~ 1. 

fcj For each < a < 1 P(a, c) has only one extremum in c. It is a minimum and < c^/^ < 1. If a = 
'f^^" (^min = i/a = 1 then Cj^in = 1- 
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Proof : (b) 



(c) 



Proofof(65) 



s(cxp(sa) — 1) 

^{lnria,c,s)) = - Q ■ >=< 

da 1 (a, s) 1 + ac 

exp(sa) — 1 

exp(s)-i >=< (Division with Q(s)) 

Ji(a, s) 1 + ac 

>=< c with K ^ ■ / — -, L= ^ ' 



1 + L — aK exp(s) — 1 ' exp(s) — s — 1 

K 

IS strictly increasing in < a < 1 

1 + L — aK 

K K 
Moreover — — 0, — = 1 

1 + L - aK |a=o 1 + L - aK \a=i 



d n T^, ^^ ^ « ,^ PMINUS(c, Q - 1) 

ac I + ac PPLUS(c, (y) 

PMINUS(c,Q - 1) a 

>=< 



PPLUS (c,g) l + ac 

^ PMINUS(c, g - 1) • (1 + ac) >=< a • PPLUS(c, Q) 
PMINUS(c, g - 1) >=< a • (PPLUS(c, Q) - c ■ PMINUS(c, Q - 1)) 

PMINUS(c,g- 1) 



>=< a (64) 

rrLUvS^c. — i j 

PMINUS(c,g - 1) 

We calculate below — — is strictly increasing in < c < 1. (65) 

PPLUSfc, Ij 



ppLus(c,g - 1) 

:tly increasing in C 
PMINUS(c, g - 1) PMINUS(c, Q - I) 



Moreover , , „, 

ppLus(c,g-i) ppLus(c,g-i) i^^^ 



d PMINUS(c,g) 4(l-c2)Q-i 



dc ppLus(c,g) (ppLus(c,g))2 



Some experimentation reveals that r{a, c, s) < 2 only for a, c as follows: 



(66) 

□ 



- An area like 1 > a > 1 — e and all 1 > c > e. e decreasing in s. The strip becomes narrower when 
c-> 1. 

- An area like < c < e and all a < 1 — e. The strip becomes narrower when c — > 0. 

- Observe that for each < P < 1 we can find an a, c with P = ac in the area described. 

Proposition 5 follows from the following three lemmas. First, we first single out 4 argument pairs (a, c) 
for which we can bound P(a, c, s) < 2. 

Lemma 11. There is a constant B < 2 such that P(a, c, s) < B for: 
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(a) a = 1 - l/Q, c = l/Q and s > 4.2. 

(b) a = l-l/Q, c = U/20 and s> 5. 

(c) a = 1/2, c=l/Qands>4:.8. 

(d) a = 1/2, c= 1/(2Q) and s > 3.0. 

The next lemma deals with pairs (a, c) with ac in the neighbourhood of 
Lemma 12. Let s > 4.2 and A = A{c, s) = c - Q. Then 



Proof of Proposition 5 for s > 6 from the preceding three lemmas. The lemmas capture all P = ac : 

- Lemma 12 captures < P < {1/2Q)Q{1/2Q) = 1/(4Q). 

- Lemma 11 (d) and (c) with Lemma 10 (b) capture 1/(4Q) <P< (1/2) (l/Q) 

- Lemma 11 (c) and (a) with Lemma 10 (c) capture (1/2) ((l/Q) < P < (1 - l/Q) (l/Q) 

- Lemma 11 (a) and (b) capture (1 - 1/Q)(1/Q) < P < (1 - l/Q)(ll/20). 

- Lemma 13 captures (85/(lG0Q))(2/5) + 1 - 85/(100Q))(2/5) < P < 1. 

As (-(3/5)(85/(lG0Q)) + 1)2/5 < 11/20 - 11/(20Q) <^ Q > 346/150 = 2.3 . . . all P are captured. 
D.3 Proof of Lemma 1 1 

Lemma 11 (repeated) There is a constant B < 2 such that P(a, c, s) < B for: 

(a) a = 1 - l/Q, c = l/Q and s > 4.2. 

(b) a = l-l/Q, c = 11/20 and s> 5. 

(c) a = 1/2, c=l/Q and s>A.?>. 

(d) a = 1/2, c=l/{2Q)ands>3.0... 

Proof : (a) We have 



r(A, c, s) < 2 for < c < 1/ (2Q), with equality only for c = 0. 



The next lemma treats ac in the neighbourhood of 1 . 



Lemma 13. Let s > 6 and A = A{c, s) = 85/(100Q) • c + 1 - 85/(100Q). Then 



r{A, c, s) < 2 for 2/5 < c < 1, with equality only for c = 1. 



Pi(l-1/Q, ,s) 



1 + 



cxp((l-l/Q)5)-(l-l/Q)s~l 
exp(s) — s — 1 




Further below we show: 



For < a < 1 



exp(as) — as — 1 



< 



exp(as) — 1 



< 



exp(as) 



, equahty only if a = 1 or a = (67) 



exp(s) — s — 1 



exp(s) — 1 



exp(s) 
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Therefore 

ri(l - l/Q, s) <1 + ''''^^^l^'^J^'^^'^ = 1 +cxp(-,s/Q). (68) 
Concerning /2(1 — l/Q, l/Q, s) we observe: 

Q'-Q ^ f -2Q + 1 \^ ^ / (2Q-i)g 

Altogether we have 



r{l - l/Q, l/Q, s)|,=4.2 < (1 + exp(-s/g)) " ^ + exp 



Q^+Q-lJ "V Q2+Q-1 



|s=4.2 

= 1.9829.(69) 



lim r(l - l/Q, 1/Q,s) = lim (1 + exp(-.s/Q)) I ( + exp ^ ^ 



= (1 + cxp(-l))(l + cxp(-2)) « 1.55. 



We show that the function in (69) is decreasing in s > 4.2. 
We consider the factors of (69) separately. 

l + cxp(-s/Q) = 1 + cxp 



exp(s) — s — 1 
cxp(s) — 1 



exp(s) -s-1 s s 

— — = 1 — and — IS decreasing . 

exp(sj — 1 exp(s) — 1 exp(s) — i 

Therefore the leftmost factor in (69) , 1 + exp(— s/Q), is decreasing. (70) 
We come to the second factor of (69.) Because of (56) we can consider Q as an independent argument. 

d , f Q^ + Q 

ln(g2 + Q) - ln(Q2 + Q-1 ^'^^ 



dQ \Q^ + Q-1, 
Q{2Q + 1) Q(2g + 1) 



Q^ + Q Q^ + Q-1 
_ 1 g(2Q + l) _ g(2Q + l) 
^ Q^ + Q Q^ + Q-1 

for an + g — 1 < X < Q'^ + Q by the Mean Value Theorem. 

The preceding term is < by direct calculation with x — Q^ + Q — 1. 



2Q2 _ Q 

is increasing by for Q > 1 by simple differentiation. 



Q2 + Q-1 



2Q^ ~Q , 
Therefore exp — is decreasing. 

* Q^ + Q-lJ 
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Proof of (67 .) For a = and a = 1 the first two terms are equal. 

(exp(as) — as — l)(exp(s) — 1) < (exp(as) — l)(exp(s) — s — 1) 

— as(exp(s) — 1) < — s(exp(as) — 1) 
a(exp(s) — 1) > cxp(as) — 1 which holds by convexity for < a < 1. 

The first inequality is shown. 

For a = 1 the terms of the second inequality are equal. Using a < 1 a simple computation shows the 
required inequality. 



r2(l - l/Q, 11/20, s) 



(b) 

We have ri(l-l/Q,s) < 1 + cxp(-s/g) (by (68)) 
31 \^ / 9 " 

^si-ii/qJ ^yn-ii/Q^ 

r(l- l/Q, 11/20, s)|,^5 < (1 + cxp(-s/Q))r2(l-l/Q, 11/20, s)|,=5 = 1.9971 ... (71) 
And lim (1 + exp(-s/g))r2(l - l/Q, 11/20, s) = (1 + exp(-l)) exp(ll/31) = 1.9505... 

s— >-oo 

We show that the function in (71) is decreasing in s. The first factor is decreasing as seen in (70.) 
The two additive terms of /2(1 — l/Q, 11/20, s) are considered separately. 

31 \^ 1 



31-11/Q; (1- 11/(31Q))Q 

ln(l-ll/(31Q))« - «(-3T^ - \ {io) - \ {ijo) - •••)(Logar>thmsenes) 

This implies by termwise differentiation ln(l — 11/(31Q))'^ > for Q > 1. 

dQ 

(1 - ii/(3iQ))^ '^'"'''''"^ ■ ^^^^ 

The last term. 



(31-110); V(i-ii/(3i<3))-3i/9 

is the product of two decreasing terms by (72.) 

(c) We show below the following statements (73) and (74.) 

ri{a, s) is decreasing in s for < a < 1. (73) 
/2Q + 2\'^ /2Q-2' ^ 



r.(i/2,i/Q,.)^^2^^^^ ' V2Q + 1, 
1 + 1/Q , f 1-1/Q \'^. . . . 

//r)^^ + . . . IS mcreasmg m s. (74) 



1 + 1/(2Q); Vl + 1/(2Q)/ 
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We have 

lim r2(l/Q, 1/2, s) = exp(l/2) + Gxp(-3/2) = 1.8718... 

S^OO 

With (74) we get r(l/2, 1/Q, s)|,=4.8 - ^(1/2, 5)r2(l/2, 1/Q, s)|,=4.8 
< ri(l/2,s)(exp(l/2) + exp(-3/2))|,^4.8 = 1.9951... 

With (73) the preceding inequality implies the claim. 

We prove (74.) The additive terms of r2{l/Q, 1/2, s) are treated independently. 

= ln(a; + 1) - ln(a;) + - with x = 2Q + 1. 

a; + 1 X 

, , , X — 1 X — 1 1 X — 1 X — 1 

Hx + 1) - ln(x) + — — = - + — 

a; + l X y X + 1 x 

for anx<y<x + l with the Mean Value Theorem. 
The very last expression for y^x + lis>0 and ((2Q + 2)/(2Q + 1))'' is increasing. 



I -1/Q Y = o(-- f-V - i 
l + l/(2Q)y' ^[ Q 2\Qj S\Q, 



1 1 / 1 \^ 1/1 



Termwise differentiation shows that In ( , | > 0. 

dQ Vl + 1/(2(3)/ 

And ((2Q - 2)/(2Q + 1))^ is increasing, (74) is shown. 
Proofof(73.) 

d ( exp(sa) — sa — 1\ a(exp(sa) — 1) exp(s) — 1 

— In -z— = — r < 

as \ exp(s) — s — 1 / exp(sa) — sa — 1 exp(s) — s — 1 

a(exp(s) - s - 1) < (exp(s) - i)!^^pM_^ ^ ( (,) (75) 

exp(sa) — 1 \ exp(sa) — 1/ 

For a = both sides of (75) are 0, for a = 1 both sides are exp(s) — s — 1. Given s > 0, the right-hand- side 
is concave in < a < 1 and (75) holds for all < a < 1 . 

(d) We show below (76) 

'4Q + 2\'^ /4g- 



^2(1/2, i/(2g), 



l + l/(2g) V , / 1-1/(2Q) \'°^ 
l + l/(4g); ^Vl + 1/(4Q); 



4Q + i; V4Q + 1. 

Q 

increasing in s. (76) 
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We have 



lim 1^2(1/2, 1/(2Q), s) = exp(l/4) + exp(-3/4) = 1.7563... 

With(76)wegetr(l/2, 1/(2Q), s)|,^3.o = ^(1/2, ,s)r2(l/2, 1/Q, s)|,^3.o 
< A(l/2,s)(exp(l/4) + cxp(-3/4))|,^3.o - 1.972... 

With (73) the preceding inequahty imphes the claim. 

To prove (76) we proceed as in the proof of (74). The (4Q + 2)/(4(5 + l)-term is treated as {2Q + 
2)/(2Q + 1) only with x = 4(3 + 1. And the (1 - l/(2(5))/(l + l/4Q)-term is treated with the logarithm 
series. □ 



D.4 Proof of Lemma 12 

Lemma 12 (repeated) Let s > 4.2 and A = A{c, s) = c - Q. Then 

r{A^ c, s) < 2 for < c < 1/ (2(5), with equality only for c — 0. 



Proof : We consider c as a function of a : Let C — C{a,s) = a/Q. Then < C{a,s) < 1/(2(5) iff 
< a < 1/2. The claim of the lemma is equivalent to: 

For s > 4.2 r(a, (7, s) < 2 for all < a < 1 /2, with equality only for a ^ 0. (77) 

We show that r{a,C,s) is strictly decreasing in < a < 1/2. As (7(0, s) = and r(0,0, s) = 2 by 
Lemma 10(a) we get (77.) 



sfexp(sa) — 1) 

i^i„r(„.c..) = ^fp^ ^ Q-il^ + O-^ TluTc^oT" >-<° 

da Ii[a,s) I + aC Q PPLUS(G,(5j 

I^SfT 2a/Q 1 PMINUS(C,Q-1) „ .t.- • • -u^^ /^n ^no. 

r7 X - T-r^ + n PPTiTQ^A n^ ^ (Division with Q, C = a/Q). (78) 

li[a,s) 1 + aC Q PPLUS(G,(5) 

With a — derivative is 0. 



We split the left-hand- side of (78) into two additive terms. The following two inequalities directly imply 
that r{a, C, s) is decreasing. 



cxp(sa)-l 

cxp(s)-i a/(5 



ri{a,s) 1 + aC 
a/Q 1 PMINUS((7,Q- 1; 
1 + aC ^ Q PPLUS (C,Q) 



< (79) 

< (80) 



32 



Proof of (80) for < a < 1 and s >2. With the calculation leading to (64) we have: 

a/Q 1 PMINUS(C,Q- 1) 



>=< 



l + aC Q PPLUS (C,g) 
PMINUS(C, g - 1) >=< a ■ PPLUS(C, Q - 1) 



\l-a/QJ I- a 



1 / \ 3 1 / \ 5 

a 1 a \ l a 



(Logarithm series , a < 1, Q > (9(2) > 2) 

Here, the <-case of the last inequaUty holds because [Q — \){\/k){a/Q)^ < jk as Q > Q{2) > 2. 
Inequality (80) is proved. 

Proof of (79) for < a < 1/2 and s > 4.2. We prove further below: 

exp(sa) — 1 exp(sa) — sa — 1 
a ^\ ' < ^\ for < a < 1, equality only for a = 1, a = 0. (81) 

exp(s) — 1 exp(s) — s — 1 

J-,. ^ exp(sa) - 1 ,^ ^, , exp(sa) - sa - 1 

Abbreviating K = K{a, s) = — ^V^^ ™d ^ = L{a^ ■s) = ^ ^ ' 

exp(s) — 1 cxp(s) — s — 1 

we have A (a, s) = 1 + L. Then (79) becomes 

K a/Q 

< 



1 + L 1 + aC 
K- [l + a-^'j < ^-(l + L) (C^a/Q) 

^ K < ^-{l + L-a-K) (82) 



With (81) inequality (82) follows from 



a s{exp{sa) - 1) 

A < — <^=^ < a. (83) 

Q exp(s) — s — 1 

To show the right-hand-side of (83) for < a < 1/2 it is sufficient to show it for a = 1/2. This as both 
sides of the inequality are for a — and its left-hand-side is convex in a. We fix a — 1/2 from now on. 
The right-hand-side of (83) is equivalent to 

sexp(sa) — aexp(s) < s — a{s + 1) = (1 — a)s — a. (84) 
With s = 4.2 (and a = 1/2) we get 0.95 • • • < 1.6. 

To get (84) for s > 4.2 we observe that its right-hand-side is increasing in s. We show that the left-hand-side 
is decreasing in s. 

— (sexp(sa) — aexp(s)) — sa exp(sa) + exp(sa) — a exp(s) < 
as 

■^=> 1 + sa < aexp{s{l - a)) (85) 
With s = 4.2 we get 3.1 < 4.8. 
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As the derivative with respect to s of the right-hand- side of (85) is > 1/2 for s — 4.2 and increasing, in- 
equahty (85) holds for all s > 4.2 finishing the argument. 

Proof of (81 ). For o = 0, a = 1 the claim holds. For < a < 1 it is equivalent to 

exp(s) — 1 / sa 
a < 1 



exp(s) — s — 1 \ exp(sa) — 1 
This inequality holds because its right-hand-side is concave in a. 



□ 



D.5 Proof of Lemma 13 

Lemma 13 (repeated) Let s > 6 and A = A{c, s) ^ 85/(100Q) • c + 1 - 85/(100Q). Then 
r{A, c, s) < 2 for 2/5 < c < 1, with equality only for c = 1. 

Proof : We show that r{A, c, s) is strictly increasing for 2/5 < c < 1. As A{1, s) — 1 and r{l, l,s) = 2 
by 10(a) this implies the lemma. 

We abbreviate A' = ^A{c, s) 



dc ' ' ' loog 



cxp(yls) — 1 A / , , , 

cxp(.)-i _ A'c + A PMINUS(c,g- 1) 
ri(A,s) l + Ac PPLUS (c,Q) 



> < (Division with Q.) (86) 



A' A' + 1 1 
Observe that c = 1 in (86) gives — 2 ^ 



The following inequalites imply that r{A, c, s) is increasing. 

^^Pi'^^)-'^ - A' cA 



^TAc > ° ^^^^ 
A , PMINUS(c,Q-l)^^^_ ^^^^ 



1 + Ac PPLUS (c,Q) 

Proof of (88) for s > 4 and 2/5 < c < 1. We show further below: 

PMINUS(c,(3 - 1) 

Given s > 2 -^^^ is concave for < c < 1 (89) 



By (64) inequality (88) is equivalent to: 

PMINUS(c,Q- 1) 



>A{c,s). (90) 



PPLUS(c,Q - 1) 

For c = 1 both sides of (90) are 1. (91) 
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We show that (90) holds for c = 2/5 and s > 4.0. Then, by (89), it holds for all 2/5 < c < 1 and we have 
(88.) 

PMINUS (2/5,Q- 1) _ T'^-i -S'^-i _ S'^-i ..,r,,r n 51 

PPLUS (2/5, Q-1) " ^ 7^-1 + 3^-1 ^"^^^(2/5,5) - ^-^OOQ- 

With Q = 4.3 the right-hand-side of (92) becomes 8.43 . . . < 8.69 .... 

Considering the derivative shows that (92) holds for all Q > 4.3. Q = Q{s) is increasing in s (cf. (56).) We 
have Q(4) = 4.32 and inequality (92) holds for s > 4. 

Proof of (89.) The derivative of the fraction is calculated in (66). Considering the numerator and denominator 
separately one sees that it is decreasing in c. 

Proof of (87) for s > 6 and , < c < 1. We abbreviate 

T^/ A N exp(As) - 1) T / A ^ exp{As) - As - 1 

K = K(A, s) = ^ , and L = Li A, s) = — ^ , [ 

^ ' ' cxp(s) - 1 ^ ' ' cxp(s) - s - 1 

We have Fi {A, s) ~1 + L. We divide with A' and (87) is equivalent to 

^ ,^ , = > — -A — ^ K{1 + Ac) -Loc. (93) 

ri{A,s) l + il + A-c ^ ^ 

As j4 < 1 we have if > i by (67) and (93) follows from 

K{1 + Ac) ~Kc ^ K{1 + Ac-c) > c. (94) 

As(^-l)c>^-l(byA< l,c< 1) inequahty (94) follows from 

K -{l + A-l) = K ■A>c. (95) 

We need to show (95) for < c < 1. This becomes easier when we consider c as a function of a. The 
inverse function of A{c, s) is 

C = C{a, s) = lOOg/85 • a + 1 - lOOQ/85. 

Then (95) for < c < 1 is equivalent to ^^P(^") ~ ^ ^ > C{a, s) for 1 - 85/(100Q) < a < 1 

exp(s) — 1 

We show (exp(as) - 1) • a > C{a, s) ■ (exp(s) - 1). (96) 
For a = 1 both sides of (96) are equal to exp(s) — 1. 

The left-hand-side of (96) is convex. Therefore (96) follows from 
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^(exp(as)-l).a^^ < (^^C{a, s) ■ {exp{s) ~ 1)^ ^ . (97) 

— — (exp(as) — 1) • a = saexp(as) + exp(as) — 1 = (1 + sa) exp(as) — 1. 
da 

'-da, s) ■ (exp(.) - 1) - 100Q(-P(^) " D - 100.(exp(.) - 1)^ 



da ^ ' ^ ' ' ' 85 85(exp(s) - s - 1) 

100s(exp(s) - 1) 



Inequality (97) follows from (1 + s) exp(s) — 1 < 



85 



l<exp(.)('i|.-l). (98) 



85 ^ ' V85 

With s = 6 inequality (98) becomes 6.0 . . . < 23.7 . . . and then it holds for all s > 6 



□ 
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